Model Selection

Conversation Optimization

# Conversation Optimization

Google Gemma 3 12b It Qat GGUF

Gemma-3-12b model based on Google QAT (Quantization-Aware Training) weight quantization, offering multiple quantized versions to accommodate different hardware requirements.

Large Language Model

Orpheus 3b FT Q4 K M.gguf

Orpheus is a high-performance text-to-speech model, fine-tuned to achieve natural and emotionally rich speech synthesis. This repository hosts the 8-bit quantized version of the 3-billion-parameter model, optimizing operational efficiency while maintaining high-quality output.

Speech Synthesis Supports Multiple Languages

Mlabonne Gemma 3 27b It Abliterated GGUF

A quantized version based on Google Gemma 3B model, optimized using llama.cpp, supporting multiple quantization levels, suitable for text generation tasks.

Large Language Model

Llama 2 7b Chat Hf Q4 K M GGUF

GGUF quantized version of Meta's Llama 2 series 7B parameter chat model, suitable for local deployment and inference

Large Language Model English

Llama 2 is a 70-billion-parameter conversation-optimized large language model developed by Meta, surpassing most open-source dialogue models in benchmark tests with safety comparable to mainstream proprietary models

Large Language Model

Transformers English

TheCraftySlayer

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase